韩国专利KR20030034274A Generalized photometric stereo for the hybrid reflectance model using an image sequences

专利PDF首页>>韩国专利

专利附录

专利说明

权利要求

类似技术

同族专利

引用文献

法律状态

优先权

专利摘要:
PURPOSE: A photo metric stereo method in a hybrid reflectance model is provided to estimate geometrical information such as surface albedo of an object, hybrid reflectance model parameters and surface normal vector from serial images to restore three-dimensional depth information of the object. CONSTITUTION: N images having different light source locations are received(S1). All of image pairs capable of being used for estimating image formation parameters are selected from the images, and the image formation parameters are repeatedly estimated from the image pairs(S2). Albedo and normal with respect to a diffused reflection component, which minimize diffused reflection error, are predicted from the selected image pairs(S3). A diffused reflection image is detected from the normal vector and diffused reflection albedo related with the diffused reflection component(S5). A total reflection image is detected from the diffused reflection image and entire image(S6). Image formation parameters related with total reflection are detected from the total reflection image(S7).
公开号:KR20030034274A
申请号:KR1020010056734
申请日:2001-09-14
公开日:2003-05-09
发明作者:박보건；윤일동；이경무；이상욱
申请人:박보건；윤일동；이경무；이상욱；
IPC主号:

专利说明:

GENERALIZED PHOTOMETRIC STEREO FOR THE HYBRID REFLECTANCE MODEL USING AN IMAGE SEQUENCES}
[14] The present invention relates to the field of computer vision for restoring the three-dimensional shape and physical properties of an object, and more particularly to a generalized photometric stereo method in a hybrid reflection model that synthesizes three-dimensional information of an object and an image from a light source. It is about.
[15] Analyzing the brightness value of the input image to restore the three-dimensional shape and physical characteristics of the object is one of the most important problems in the field of computer vision. Since the brightness value of the image is greatly influenced by the surface reflection characteristics of the object, various surface reflection models have been studied. The most common of these is the Lambertian model. However, this model only reflects the diffuse reflection component, so it is difficult to apply to the actual surface where the total reflection component exists.
[16] To overcome this problem, various hybrid reflection models have been proposed. Although the derivation process and the basic assumptions of these models are different from each other, most models separate the reflection component of the surface into total reflection component and diffuse reflection component. Based on this, a study was conducted to separate input images into diffuse reflection images and total reflection images. Coleman and Jain proposed an algorithm that extracts an image with one total reflection component for each pixel from four input images. Shashua also used a similar algorithm to detect total reflection components from the input image. However, this method has a big problem when the total surface reflection component exists in more than one sheet because the surface of the object is rough and generally does not work effectively for N images.
[17] SUMMARY OF THE INVENTION The present invention has been made to solve this problem, and an object of the present invention is to estimate geometric information such as surface albedo and hybrid reflection model variables and surface normal vector from an N-field serial image It is to provide a generalized photometric stereo method in a hybrid reflection model that reconstructs three-dimensional depth information of and synthesizes an image from an arbitrary light source.
[18] In order to achieve this object, the present invention provides a photometric stereo method in a hybrid reflection model for synthesizing an image from an arbitrary light source, comprising: receiving N images having different positions of the light sources; Selecting all image pairs available for estimating image formation parameters from the images and repeatedly estimating image formation variables from them; Estimating the albido and normal for the diffuse reflection component that minimizes the diffuse reflection error from the selected image pairs; Detecting a diffuse reflection image from a normal vector and a diffuse reflection aldodo associated with the detected diffuse reflection component; Detecting a total reflection image from the diffuse reflection image and the entire image; Detecting an associated image forming parameter relating to total reflection from the total reflection image.
[1] 1 is a schematic block diagram of an apparatus for performing a photometric stereo method in a mixed reflection model according to the present invention;
[2] 2 is a flowchart of a photo metric stereo method in a mixed reflection model according to the present invention;
[3] 3 is a view showing an image shape model in a photometric stereo method in a mixed reflection model according to the present invention;
[4] 4 is a diagram showing an example of distribution of a normal vector for a specific pixel in the photometric stereo method in the hybrid reflection model according to the present invention;
[5] 5 is a graph showing an estimation error according to the number of input images obtained by experimenting with a photometric stereo method in a mixed reflection model according to the present invention;
[6] 6 is a view showing the influence of the total reflection component according to the surface roughness in the experiment of the photometric stereo method in the hybrid reflection model according to the present invention,
[7] 7 is a graph comparing relative estimation error according to surface roughness in the experiment of the photometric stereo method in the mixed reflection model according to the present invention;
[8] 8 is a view showing an image synthesis state for the Juliet gypsum in the experiment of the photometric stereo method in the hybrid reflection model according to the present invention;
[9] 9 shows a three-dimensional shape recovery state of the Juliet gypsum phase in the experiment of the photometric stereo method in the hybrid reflection model according to the present invention.
[10] <Description of the symbols for the main parts of the drawings>
[11] 1 controller 3 imager
[12] 5 light source 7 image processing apparatus
[13] 9: 3D object
[19] Hereinafter, an embodiment according to the present invention will be described in detail with reference to the accompanying drawings.
[20] 1 shows a schematic block diagram of an apparatus for carrying out the invention. As shown, the present invention includes an imager 3 and a light source 5 which are driven under the control of the controller 1. The controller 1 provides light to the same object while varying the position of the light source 5 as described below, and the imager 3 picks up the reflected light of the differently reflected object according to the position of the light source 5. .
[21] The reflected light, i.e., the image signal, picked up by the imager 3 is provided to the image processing apparatus 7, and the image processing apparatus 7 processes the image signal by the method of the present invention to be described later by using the three-dimensional object 9 (information). ).
[22] 2 shows a flowchart of the present invention performed by the image processing apparatus 7. Before explaining the operation flow of Figure 2 will be described below the basic concept used in the present invention.
[23] First, the present invention uses a hybrid reflection model that generalizes a Lambertian surface that is assumed by most SFS (shape from shading) algorithms. It is determined only by the relationship between the surface normal vector and the direction vector of the light source, irrespective of the viewpoint. However, since the reflection surface as well as the total reflection component exist on the actual surface, a new model that reflects them is required, and various hybrid reflection models have been proposed. In the present invention, a hybrid reflection model is used in which the diffuse reflection component is approximated as a Lambertian model and the total reflection component is approximated as a Torrance-sparrow model. An image forming model necessary to define a generalized hybrid reflection model is shown in FIG. 3.
[24] All vectors illustrated in FIG. 3 are unit vectors, Is the normal vector of the surface Is the line of sight vector from the surface of the object to the camera. And Is the direction vector of the light source, Denotes a total reflection direction vector as the midpoint vector of the gaze and the light source, and has a relation of Equation 1 below.
[25]
[26] On the other hand, the image brightness (radiance) L obtained by the imaging device can be expressed by the equation (2).
[27]
[28] here, Is the image brightness value due to diffuse reflection, Is the image brightness value by the first half, Albiddo due to diffuse reflection, Is the albido by total reflection, k is the parameter according to the surface material of the object, Is the total reflection direction vector ( ) And the normal vector of the surface ( Is the angle rad between
[29] As can be seen from Equation 2, in the hybrid surface reflection model, the image brightness value is a diffuse reflection component ( ) And total reflection component ( Are divided into In the present invention, the variable ( ) To determine the diffuse reflection surface and the total reflection surface. In the present invention, as will be described later, a diffuse reflection surface is obtained by first selecting an image that provides only a diffuse reflection component or ignores a total reflection component, and detects the total reflection surface superimposed thereon using the image.
[30] Next, an error measure of the present invention for measuring the accuracy of the estimated image forming variables will be described. The error measure E used in the present invention is defined in consideration of the brightness value of the N image, the hybrid reflection model used, and the mathematical stability of the estimate, which is defined as Equation 3 below.
[31]
[32] here Is the kth input image, The diffuse reflection image and the total reflection image of the kth input image, Represents each reconstructed image, Is a diffuse reflection error, Indicates a total reflection error. Also , The weights of the errors of the diffuse reflection and the total reflection are determined by reflecting the error or quantization error by the surface reflection model and the stability of the mathematically estimated value, and are defined by Equation 4.
[33]
[34] here , Is a weight value considering the estimation error and quantization error by the surface reflection model, Is a weight defined based on the mathematical stability of the image formation parameter estimation. Given the images obtained from the same camera as the surface reflection model, assuming that the quantization effect appears uniformly over the entire area, ) Has a constant value. On the other hand, weights ( ) Is a value that depends on the surface of the object and plays an important role in minimizing errors. According to the present invention, as described later, the weight (p) from the phase on the PQ map of the estimated image forming variables ).
[35] There are two ways to obtain the PQ map for the Lambertian surface from three input images. One method is to assume the albido of the surface of the object, and the other is to find the albedo as well as the PQ map when the albedo of the object surface is unknown. In the present invention, since the latter method is applied, this is briefly described and the mathematical stability is extracted accordingly.
[36] Image brightness value acquired by assuming Lambertian surface by default Is the same as Equation 5.
[37]
[38]
[39] here Is the brightness value of the i light source Is the unit position vector of the light source, Is the unit normal vector. If they are expressed in matrix form, they are represented by Equation 6.
[40]
[41] if, Ramen Equation 6 can be replaced with Equation 7, from which the normal vector ( ) Can be obtained.
[42]
[43] The normal vector found here ( ) Becomes
[44]
[45]
[46]
[47] Through the process of Equation 8 from Equation 8, the surface slope p, `` q not including three variables can be obtained, and the error scale E and the rabidido ρ can be obtained using this.
[48]
[49] On the PQ map of the estimated image forming variables, two important factors influence the mathematical stability. The first is the estimated stability according to the combination of the light source pairs. Error from each input image Suppose is given. At this time, equation (10) is obtained from equation (7).
[50]
[51] If you transform it, the error vector ( ) Is obtained by Equation 11.
[52]
[53] This error vector ( ) Can be calculated by Equation 12.
[54]
[55] Now the direction vector ( The determinant value of) ) Can be expressed as in Equation 13.
[56]
[57] Condition value ( ) Represents the condition of the light sources that determine the matrix. If the value is small, the position of the three light sources is linearly dependent. ) Is almost crowded in one place. Thus, the normal vectors of the object surface are biased in these directions and the correct normal direction cannot be obtained. Also, the smaller this value is, the more error vector (Equation 12) ) Increases in size, making it difficult to obtain accurate solutions and increasing errors. Like this Three light sources having a value of ) Below a certain value are referred to herein as ill-conditioned light source pairs. The unfavorable light source pairs do not always give an exact solution and are therefore always excluded from the estimation of image formation parameters.
[58] Now consider this factor and consider the weight ( ). A set of all lighting pairs that participate in determining the diffuse reflection component at pixel (x, y) Let's do it. From this the weight ( ) Is the same as Equation 14.
[59]
[60] Now, by substituting the weights defined in Equations 14 and 4 into Equation 3, an error measure E considering the weights can be obtained. By estimating image forming parameters that minimize this error measure E, desired object reflection characteristics and shape recovery can be achieved. In general, however, it is difficult to find an optimal solution of an error measure given by a nonlinear equation as in Equation 3. Therefore, the present invention proposes a method of optimizing step by step without finding a solution to optimize them globally.
[61] In the proposed invention, the error measure is minimized step by step in Equation 3 and iteratively updates the estimates of image forming variables. To do this, the diffuse reflection image is obtained from the input image, and the total reflection image is separated using the difference between the original image and the diffuse reflection image. In this process, the surface normal and diffuse reflection albido are estimated. Finally, the image forming variables related to the diffuse reflection image are obtained while separating the total reflection image, and the diffuse reflection error ( Minimize). Total reflection error from the total reflection images Estimate the remaining image forming parameters that minimize).
[62] Hereinafter, the present invention will be described in detail with reference to the flowchart of FIG. 2.
[63] The present invention is largely divided into two stages. The first is a diffuse reflection error ( Acquisition of diffuse reflection image forming parameters that minimize In this step, image forming parameters are minimized.
[64] The image processing apparatus 7 of the present invention first receives N images taken by the imager 3 (S1) and can be used for estimating image formation parameters therefrom, i.e., ill-conditioned light source pairs. All image pairs except for are selected and the image forming parameters are repeatedly estimated from them (S2).
[65] In the process of extracting an image formation variable, when N input images are given, all possible image pairs used for image formation parameter estimation are selected for each pixel, and image formation parameter estimation is repeatedly performed from them. To this end, the area of the surface of the target object is first separated from the input image. The shadow areas cannot be separated by the image alone because geometric information must be given, but the areas dominated by the total reflection effect can be separated only by the input image. Extraction of image formation parameters for pixels corresponding to this region is impossible, and image formation parameters are determined by interpolation from ambient values. Next, unfavorable light source pairs are excluded from all image pairs. Through this initial process, only some image pairs are selected among all image pairs that can be combined for each pixel.
[66] After performing step S2, the image processing apparatus 7 proceeds to step S3 to perform a process of estimating the albido and the normal. That is, the image processing apparatus may include (7) normal vectors of selected image pairs ( Not only separates the shadow area from the distribution of, but also the normal vector ( Extract) Assume that a specific pixel (x, y) is given. The normal vector obtained from image pairs for this pixel The mean vector of ), And variance ( Let's say (x, y)). If the variance value ( (x, y)) is the threshold Less than), the estimated mean normal vector ( ) Is assumed to be the surface normal vector for this pixel. If it is larger than the threshold, it is considered to be due to the shadow effect and the same process is repeated until it converges again except for vectors that are far from the mean. Where the threshold ( ) Considers sensor noise and shows the distribution, average, and threshold of normal vectors for a specific pixel in FIG. 4. 4a shows them on a unit sphere, and FIG. 4b shows them ( ) To the area.
[67] The normal vector thus estimated ( Weights given by (14) Calculate If this value is adequate, it means that the estimated value is correct. However, if this value is too large, remove the component that causes the large value and then re-normalize the normal vector (for that pixel). Calculate
[68] The image processing apparatus 7 performs the process of performing the step S3, for example, Equation 4, so that the minimum diffuse reflection error ( Steps S2 and S3 are detected until (S4), and in this process, normal vectors related to the diffuse reflection component ( ) And diffuse reflection albido ( ). The normal vector related to the diffuse reflection component detected by performing the above-described step (S3) ) And diffuse reflection albido ( ) From the diffuse reflection image ( ) Can be obtained (S5) and the original image ( ) And diffuse reflection ) To detect the difference ) Is obtained (S6).
[69]
[70] On the other hand, as shown in Equation 2 of the above-described hybrid reflection model, the total brightness image brightness value ( ) Is the same as the equation (16).
[71]
[72] Applying a natural logarithm to both sides of Eq. (16) yields a linear equation such as Eq. (17).
[73]
[74] here
[75] , , Means.
[76] The brightness value of the total reflection image obtained from Equation 16 ) And image forming variables related to total reflection components through LS (least square) algorithm for each pixel using (S7) In other words, A and B are known constants, so there are two unknown variables. Therefore, if you give A and B two pairs, Can be obtained. However, since more than two pairs are actually given, the optimal solution can be found using LS.
[77] After performing step S7 described above, the image processing apparatus 7 proceeds to step S8 and synthesizes the detected diffuse reflection image and the total reflection image, and provides three-dimensional object information. The three-dimensional object information will be information detected through the above-described process, it will be readily known to those skilled in the art that it can be provided as a single image using them.
[78] The present inventors have experimented with the present invention performing the above-described process, and the following describes the experimental results. In the experiment of the present invention, the error of coefficient estimation was measured while varying the number of input images with respect to the composite image to obtain a relationship between the number of input images and the coefficient estimation. For this purpose, a series of composite images were generated by varying the number of light sources from randomly generated spheres. The position of the light source was uniformly distributed so that the relative size of the total reflection component was constant for each image.
[79] Diffuse reflection albido (synthetic sphere) ) Was set to 200, and the value of parameter (k) was changed to 30, 50, 70 to determine the effect of surface roughness. From the estimated coefficients, the original image was synthesized to obtain a difference in brightness from the original image. In addition, the difference from the actual depth value was examined by restoring the 3D depth information, and the experimental results are shown in FIG. 5. FIG. 5A illustrates an error of brightness values of a resynthesized image, and FIG. 5B illustrates an error of reconstructed depth information. In FIG. 5B, a depth error means a relative error with respect to a radius of a synthesized sphere. As shown in FIG. 5, it can be seen that the estimated brightness value and depth information are more accurate as the number of input images increases regardless of surface roughness. Therefore, as claimed in the present invention, it can be seen that accurate estimation is possible by using more input images than using the existing four or six chapters. In addition, it can be seen that the albedo works stably even when the actual value is overestimated.
[80] The roughness of the surface is given by the parameter k in Equation 2. As can be seen in FIG. 6, when the parameter (k) value is large, the surface is smooth to form a bright surface and the total reflection component appears in a very small area. When the parameter (k) value is small, the surface is rough and total reflection component. It spreads evenly over the whole area.
[81] Eventually, the size of the total reflection lobe is determined by the parameter k, and the total reflection lobe has a great influence on estimating the diffuse reflection component. In this experiment, the estimation error was obtained by changing the parameter (k) for the synthesis sphere. Experimental results show that the smoother the surface, the more accurate coefficient estimation is possible. Another thing to note here is the relative magnitude difference between the estimated brightness value error and the depth information error. If the surface is rough, the error of the estimated brightness value is quite large, whereas the error of the estimated three-dimensional depth information is small. This is caused by the estimation error of diffuse reflection albedo as the total reflection component is spread over a large area on the surface of the object. If k = 10, the estimated albido is 220, on average, which is quite error-prone. In this case, the total reflection component evenly spread over a wide area may be estimated as an diffuse reflection component and reflected in the albido. However, even though the error of the albedo value is large, the actual brightness value error and the estimation error of the 3D depth information are relatively small. From this result, it can be seen that the proposed algorithm successfully recovers three-dimensional shape even if the estimated total aberration lobe is large. A graph of the total reflection albedo estimation error and the three-dimensional depth information estimation error according to the parameter k is shown in FIG. 7. In this case, the y-axis is a relative error, which is obtained by dividing the error by the true value albedo in the case of albedo estimation, and in the case of depth estimation, by dividing the error by the radius of the composite sphere.
[82] In order to confirm whether the proposed algorithm is robust for real images, we experimented with plaster images. Gypsum is an object with a relatively rough surface and is spread evenly over a large total reflection component. Estimated using 12 input images, two of them and a reconstructed image of each are shown in FIG. 8. In c and d of FIG. 8, portions having zero brightness values do not have effective lighting pairs, and coefficient estimation is not performed there, and thus no synthesized brightness values exist. Reconstructed three-dimensional information about Juliet gypsum image is shown in FIG. In this case, as shown in FIG. 9, the portion where the normal vector was not estimated was replaced with the surrounding value to restore the 3D shape. Therefore, the error on both sides of the nose affects the entire layer, but it can be confirmed that the surface of the face is well restored.
[83] As described above, the present invention provides a method for accurately estimating the inclination of the surface from the N- field image to which the hybrid reflection model can be applied. It is possible to provide an accurate three-dimensional image model.

权利要求:
Claims (4)
[1" claim-type="Currently amended] A photometric stereo method in a hybrid reflection model that synthesizes images from any light source,
Receiving N images having different positions of the light source;
Selecting all image pairs available for estimating image formation parameters from the images and repeatedly estimating image formation variables from them;
Estimating the albido and normal for the diffuse reflection component that minimizes the diffuse reflection error from the selected image pairs;
Detecting a diffuse reflection image from a normal vector and a diffuse reflection aldodo associated with the detected diffuse reflection component;
Detecting a total reflection image from the diffuse reflection image and the entire image;
Detecting a related image forming parameter relating to total reflection from the total reflection image.
[2" claim-type="Currently amended] The method of claim 1,
The image pairs usable for estimating image formation parameters from the images are all image pairs except an ill-conditioned light source pair, and the evil pair is a condition value ) Means a pair of images below the threshold,

In the above formula Is a photometric stereo method in a mixed reflection model, meaning a light source vector.
[3" claim-type="Currently amended] The method of claim 1 or 2, wherein minimizing the albido and the normals,
Detecting a mean vector and a variance value of the normal vectors obtained from the image pairs for the pixel;
If the variance is less than a certain threshold, the estimated average normal vector is assumed to be a normal vector for this pixel. If the variance is larger than the threshold, the average is detected again except for vectors that are far from the average vector to detect the normal vector. Making a step;
The weights defined based on the mathematical stability of the image formation parameter estimation from the detected normal vectors ) Re-detecting the normal vector if the value of the ) Is greater than or equal to a predetermined value.
[4" claim-type="Currently amended] The method of claim 3, wherein
The detecting of the total reflection image from the diffuse reflection image and the entire image comprises subtracting the diffuse reflection image from the whole image.

类似技术:

公开号 | 公开日 | 专利标题

US9420265B2|2016-08-16|Tracking poses of 3D camera using points and planes

CN102834845B|2016-10-26|The method and apparatus calibrated for many camera heads

Fusiello et al.1999|Improving feature tracking with robust statistics

Klein et al.2006|Full-3D Edge Tracking with a Particle Filter.

Audras et al.2011|Real-time dense appearance-based SLAM for RGB-D sensors

US7227973B2|2007-06-05|Device, method and record medium for image comparison

Jin et al.2001|Real-time feature tracking and outlier rejection with changes in illumination

Pollefeys et al.1998|Metric 3D surface reconstruction from uncalibrated image sequences

US5003166A|1991-03-26|Multidimensional range mapping with pattern projection and cross correlation

JP3151284B2|2001-04-03|Apparatus and method for salient pole contour grading extraction for sign recognition

Nayar et al.1990|Shape from focus: An effective approach for rough surfaces

US6670963B2|2003-12-30|Visual attention model

KR101075716B1|2011-10-21|Apparatus and method for inpainting image

DE10081029B4|2013-02-28|Image editing to prepare a textual analysis

US6173066B1|2001-01-09|Pose determination and tracking by matching 3D objects to a 2D sensor

EP2848003B1|2018-11-14|Method and apparatus for acquiring geometry of specular object based on depth sensor

US4825393A|1989-04-25|Position measuring method

EP2479726B9|2013-10-23|Image comparison system and image comparison method

US7711182B2|2010-05-04|Method and system for sensing 3D shapes of objects with specular and hybrid specular-diffuse surfaces

Wu et al.2011|High-quality shape from multi-view stereo and shading under general illumination

Bab-Hadiashar et al.1998|Robust optic flow computation

JP5671281B2|2015-02-18|Position / orientation measuring apparatus, control method and program for position / orientation measuring apparatus

US8326025B2|2012-12-04|Method for determining a depth map from images, device for determining a depth map

Hara et al.2005|Light source position and reflectance estimation from a single view without the distant illumination assumption

US6023530A|2000-02-08|Vector correlation system for automatically locating patterns in an image

同族专利:

公开号 | 公开日

KR100479983B1|2005-03-30|

引用文献:

公开号 | 申请日 | 公开日 | 申请人 | 专利标题

法律状态:
2001-09-14|Application filed by 박보건, 윤일동, 이경무, 이상욱

2001-09-14|Priority to KR10-2001-0056734A

2003-05-09|Publication of KR20030034274A

2005-03-30|Application granted

2005-03-30|Publication of KR100479983B1

优先权:

申请号 | 申请日 | 专利标题

KR10-2001-0056734A|KR100479983B1|2001-09-14|2001-09-14|Generalized photometric stereo for the hybrid reflectance model using an image sequences|

[返回顶部]